NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

BMQSim: Overcoming Memory Constraints in Quantum Circuit Simulation with a High-Fidelity Compression Framework

Zhang, Boyuan; Fang, Bo; Ye, Fanjiang; Guo, Luanzheng; Song, Fengguang; Tallent, Nathan; Tao, Dingwen (June 2025, ACM)

Free, publicly-accessible full text available June 9, 2026
GFormer: Accelerating Large Language Models with Optimized Transformers on Gaudi Processors

Zhang, Chengming; Ding, Xinheng; Sun, Baixi; Yu, Xiaodong; Zheng, Weijian; Xie, Zhen; Tao, Dingwen (December 2024, arXivorg)

Full Text Available
LCP: Enhancing Scientific Data Management with L ossy C ompression for P articles

https://doi.org/10.1145/3709700

Zhang, Longtao; Li, Ruoyu; Ren, Congrong; Di, Sheng; Liu, Jinyang; Huang, Jiajun; Underwood, Robert; Grosset, Pascal; Tao, Dingwen; Liang, Xin; et al (February 2025, Proceedings of the ACM on Management of Data)

Many scientific applications opt for particles instead of meshes as their basic primitives to model complex systems composed of billions of discrete entities. Such applications span a diverse array of scientific domains, including molecular dynamics, cosmology, computational fluid dynamics, and geology. The scale of the particles in those scientific applications increases substantially thanks to the ever-increasing computational power in high-performance computing (HPC) platforms. However, the actual gains from such increases are often undercut by obstacles in data management systems related to data storage, transfer, and processing. Lossy compression has been widely recognized as a promising solution to enhance scientific data management systems regarding such challenges, although most existing compression solutions are tailored for Cartesian grids and thus have sub-optimal results on discrete particle data. In this paper, we introduce LCP, an innovative lossy compressor designed for particle datasets, offering superior compression quality and higher speed than existing compression solutions. Specifically, our contribution is threefold. (1) We propose LCP-S, an error-bound aware block-wise spatial compressor to efficiently reduce particle data size while satisfying the pre-defined error criteria. This approach is universally applicable to particle data across various domains, eliminating the need for reliance on specific application domain characteristics. (2) We develop LCP, a hybrid compression solution for multi-frame particle data, featuring dynamic method selection and parameter optimization. It aims to maximize compression effectiveness while preserving data quality as much as possible by utilizing both spatial and temporal domains. (3) We evaluate our solution alongside eight state-of-the-art alternatives on eight real-world particle datasets from seven distinct domains. The results demonstrate that our solution achieves up to 104% improvement in compression ratios and up to 593% increase in speed compared to the second-best option, under the same error criteria.
more » « less
Free, publicly-accessible full text available February 10, 2026
COMPSO: Optimizing Gradient Compression for Distributed Training with Second-Order Optimizers

https://doi.org/10.1145/3710848.3710852

Sun, Baixi; Liu, Weijin; Pauloski, J Gregory; Tian, Jiannan; Jia, Jinda; Wang, Daoce; Zhang, Boyuan; Zheng, Mingkai; Di, Sheng; Jin, Sian; et al (February 2025, ACM)

Free, publicly-accessible full text available February 28, 2026
A High-Quality Workflow for Multi-Resolution Scientific Data Reduction and Visualization

https://doi.org/10.1109/SC41406.2024.00091

Wang, Daoce; Grosset, Pascal; Pulido, Jesus; Athawale, Tushar M; Tian, Jiannan; Zhao, Kai; Lukić, Zarija; Huebl, Axel; Wang, Zhe; Ahrens, James; et al (November 2024, IEEE)

Full Text Available
Accelerating Communication in Deep Learning Recommendation Model Training with Dual-Level Adaptive Lossy Compression

https://doi.org/10.1109/SC41406.2024.00095

Feng, Hao; Zhang, Boyuan; Ye, Fanjiang; Si, Min; Chu, Ching-Hsiang; Tian, Jiannan; Yin, Chunxing; Deng, Summer; Hao, Yuchen; Balaji, Pavan; et al (November 2024, IEEE)

Full Text Available
CUSZ-i: High-Ratio Scientific Lossy Compression on GPUs with Optimized Multi-Level Interpolation

https://doi.org/10.1109/SC41406.2024.00019

Liu, Jinyang; Tian, Jiannan; Wu, Shixun; Di, Sheng; Zhang, Boyuan; Underwood, Robert; Huang, Yafan; Huang, Jiajun; Zhao, Kai; Li, Guanpeng; et al (November 2024, IEEE)

Full Text Available
A Survey on Error-Bounded Lossy Compression for Scientific Datasets

https://doi.org/10.1145/3733104

Di, Sheng; Liu, Jinyang; Zhao, Kai; Liang, Xin; Underwood, Robert; Zhang, Zhaorui; Shah, Milan; Huang, Yafan; Huang, Jiajun; Yu, Xiaodong; et al (May 2025, ACM Computing Surveys)

Error-bounded lossy compression has been effective in significantly reducing the data storage/transfer burden while preserving the reconstructed data fidelity very well. Many error-bounded lossy compressors have been developed for a wide range of parallel and distributed use cases for years. They are designed with distinct compression models and principles, such that each of them features particular pros and cons. In this paper we provide a comprehensive survey of emerging error-bounded lossy compression techniques. The key contribution is fourfold. (1) We summarize a novel taxonomy of lossy compression into 6 classic models. (2) We provide a comprehensive survey of 10 commonly used compression components/modules. (3) We summarized pros and cons of 47 state-of-the-art lossy compressors and present how state-of-the-art compressors are designed based on different compression techniques. (4) We discuss how customized compressors are designed for specific scientific applications and use-cases. We believe this survey is useful to multiple communities including scientific applications, high-performance computing, lossy compression, and big data.
more » « less
Free, publicly-accessible full text available May 2, 2026
Centimani: Enabling Fast AI Accelerator Selection for DNN Training with a Novel Performance Predictor

Xie, Zhen; Emani, Murali; Yu, Xiaodong; Tao, Dingwen; He, Xin; Su, Pengfei; Zhou, Keren; Vishwanath, Venkatram (July 2024, 2024 USENIX Annual Technical Conference (USENIX ATC 24))

Full Text Available
Multifacets of lossy compression for scientific data in the Joint-Laboratory of Extreme Scale Computing

https://doi.org/10.1016/j.future.2024.05.022

Cappello, Franck; Acosta, Mario; Agullo, Emmanuel; Anzt, Hartwig; Calhoun, Jon; Di, Sheng; Giraud, Luc; Grützmacher, Thomas; Jin, Sian; Sano, Kentaro; et al (February 2025, Future Generation Computer Systems)

Free, publicly-accessible full text available February 1, 2026

« Prev Next »

Search for: All records